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Abstract. We design the first efficient algorithms and prove new combinatorial bounds for 
list decoding tensor products of codes and interleaved codes. 

• We show that for every code, the ratio of its list decoding radius to its minimum distance 
stays unchanged under the tensor product operation (rather than squaring, as one might 
expect). This gives the first efficient list decoders and new combinatorial bounds for 
some natural codes including multivariate polynomials where the degree in each variable 
is bounded. 

• We show that for every code, its list decoding radius remains unchanged under m-wise 
interleaving for an integer m. This generalizes a recent result of Dinur et al. [6], who 
proved such a result for interleaved Hadamard codes (equivalently, linear transforma- 
tions). 

• Using the notion of generalized Hamming weights, we give better list size bounds for both 
tensoring and interleaving of binary linear codes. By analyzing the weight distribution 
of these codes, we reduce the task of bounding the list size to bounding the number of 
close-by low-rank codewords. For decoding linear transformations, using rank-reduction 
together with other ideas, we obtain list size bounds that are tight over small fields. 

Our results give better bounds on the list decoding radius than what is obtained from 
the Johnson bound, and yield rather general families of codes decodable beyond the Johnson 
bound. 



1. Introduction 

The decoding problem for error-correcting codes consists of finding the original message given 
a corrupted version of the codeword encoding it. When the number of errors in the codeword is 
too large, and in particular could exceed half the minimum distance of the code, unambiguous 
recovery of the original codeword is no longer always possible. The notion of list decoding 
of error-correcting codes, introduced by Elias [10] and Wozencraft [35], provides one avenue 
for error-correction in such high noise regimes. The goal of a list decoding algorithm is to 
efficiently recover a list of all codewords within a specified Hamming radius of an input string. 
The central problem of list decoding is to identify the radius up to which this goal is tractable, 
both combinatorially (in terms of the output list being guaranteed to be small, regardless of 
the input) and algorithmically (in terms of being able to find the list efficiently). 

The classical Johnson bound shows that at least combinatorially, list decoding always allows 
one to correct errors beyond half the minimum distance. It states that every code of distance 
5 over Fg is list-decodable up to the Johnson radius Jq{5) which lies in the range (5/2,(5]. 
However, the Johnson bound is oblivious to the structure of the code; it only depends on its 
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minimum distance. Potentially, a code might be list-decodable at larger error-radii than what 
is guaranteed by the Johnson bound. The question of identifying the precise radius up to 
which list decoding is tractable for a family of codes is a challenging problem. Despite much 
progress in designing list decoding algorithms over the last decade, this problem is still open 
even for well-studied codes such as Reed-Solomon and Reed-Muller codes. 

On the algorithmic side, following the breakthrough results of Goldreich-Levin [11] and Sudan 
[28] which gave list decoders for Hadamard codes and Reed-Solomon codes respectively, there 
has been tremendous progress in devising list decoders for various codes (see the surveys 
[15, 16, 29]). This study has had substantial impact on other areas such as complexity theory 
[30, 32], cryptography [11, 1] and computational learning [19, 21]. Examples of codes which 
are known to be list-decodable beyond the Johnson bound have been rare: Extractor codes 
[31, 14], folded Reed-Solomon codes [24, 17], group homomorphisms [6] and Reed-Muller codes 
over small fields [12] are the few examples known to us. 

A natural way to design new error-correcting codes from old ones is via various product op- 
erations on these codes. In this work, we study the effect of two basic product operations, 
tensoring and interleaving, on list-decodability. In what follows, [q] stands for an alphabet of 
size q, for example {0, 1, . . . , g — 1}. 

Definition 1.1. (Tensor Product) Given two linear codes Ci C [g]"-! and C2 ^ [9]"^, their 
tensor product C2'S>Ci consists of all matrices in [g]"2^'^i whose rows belong to Ci and columns 
belong to €2- For a code C C [q]"", its m-wise tensor product for m ^ 1 is a code of length 
defined inductively as C^^ = C and C®'" = O C for m > 1. 

For example, Reed-Muller codes in m variables where the degree in each variable is restricted 
to d can be viewed as the m-wise tensor of Reed-Solomon codes. Our algorithm does not 
require the CjS to be linear, but we make the assumption since the tensor of two non-linear 
codes might be empty. Using 6{C) and R{C) to denote the normalized distance and rate 
of C respectively, it follows that (5(C®™) = (5(C)™ and i?(C®™) = i?(C)™. Hence for tensor 
products, we are primarily interested in the setting where m is either constant or a slowly 
growing function of the block length. 

Definition 1.2. (Interleaved Codes) The m-wise Interleaving (or interleaved product) C®"^ 
of the code C C [g]" consists of n x m matrices over [q] whose columns are codewords in C. 
Each row is treated as a single symbol, thus C®"^ C [g™]". 

In other words, under m-wise interleaving, m independent messages are encoded using C, and 
the m symbols in each position are juxtaposed together into a single symbol over a larger 
alphabet. For instance, linear transformations from F2 can be viewed as interleaved 

Hadamard codes. It is easy to see that 5(C®'") = 6{C), i?(C®"") = R{C) but the alphabet 
grows from [q] to m-dimensional vectors over [q]. So unlike for tensors, for interleaving m 
could be as large as polynomial in the block length; indeed our results hold for any m. 

1.1. Tensor products and interleaved codes: Prior work and motivation. 

1.1.1. Tensoring. Tensor products occupy a central place in coding theory, so much so that 
tensor product codes are typically referred to as just product codes. Product codes provide 
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a convenient way to construct longer codes from shorter component codes. Elias [9] used 
tensor product of Hamming codes to construct the first exphcit codes with positive rate for 
communication over a binary symmetric channel. The structure of product codes enables de- 
coding them along columns and rows, and the column decoder can provide valuable reliability 
information to the row decoder [26]. Product codes find many uses in practice; for example, 
the product of two Reed-Solomon codes is used for encoding data on DVDs. 

More recently, tensor products have found applications in several areas of theoretical CS such 
as hardness of approximation [8, 20] and constructions of locally testable codes [22]. The 
effect of tensoring on on local testability of codes has been extensively studied [2, 7, 34, 4]. 
Tensor products admit a natural tester (which checks a random row/column) that has a 
certain "robustness" property [2]. Exploiting this, by recursive tensoring one can obtain 
simple constructions of locally testable codes with non-trivial parameters, starting from any 
reasonable constant-sized code. But to our knowledge, there seems to no prior work focusing 
on the effect of tensoring on the list decoding radius. In particular, the best combinatorial 
bound known for the list decoding radius seems to have been the Johnson bound, and we are 
unaware of an efficient algorithm that decodes C2 Ci up to the Johnson bound, assuming 
that such algorithms exists for each Cj. A sufficiently strong result on list decoding tensor 
products could lead to a simple, recursive construction of list-decodable codes starting from 
any small code of good distance. Our results are optimal in terms of the radius to which we 
decode, but not strong enough in terms of output list-size guarantee to obtain such a result. 

1.1.2. Interleaving. At a high level, interleaving is a way to arrange data in a non-contiguous 
way in order to increase performance. Interleaving is used in practical coding systems to group 
together the symbols of several codewords as a way to guard against burst errors. A burst 
error could cause too many errors within one codeword, making it unrecoverable. However, 
with interleaving, the errors get distributed into a small, correctable number of errors in each 
of the interleaved codewords. This is quite important in practice; for example the data in a CD 
is protected using cross-interleaved Reed-Solomon coding (CIRC). Though code interleaving 
has been implicitly studied for its practical importance, our work appears to be the first to 
study it in generality as a formal product operation. We describe some recent theoretical work 
on interleaved codes which set the stage for our investigation. 

The problem of decoding interleaved Reed-Solomon codes from a large number of random 
errors was tackled in [5, 3]. The folded Reed-Solomon codes constructed by Guruswami 
and Rudra [17] (which achieve list decoding capacity), and their precursor Parvaresh-Vardy 
codes [24], are both sub codes of interleaved Reed-Solomon codes, where the m interleaved 
codewords are carefully chosen to have dependencies on each other. Dinur et al. in their 
work on list decoding group homomorphisms studied interleaved Hadamard codes, which are 
essentially all linear transformations from [6] . Their work raised the question of how 

the interleaving operation affects the list decoding radius of arbitrary codes, and motivated 
our results. 

1.2. Brief high-level description of key results. We start with some terminology. 
The distance dist(C) of a code C C [g]" is the minimum Hamming distance between two 
distinct codewords in C, and its relative distance is defined as dist(C)/n. For a code C of block 
length n and < < 1, the list-size for radius 1], denoted i{C, rj), is defined as the maximum 
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number of codewords in a Hamming ball of radius ryn. Informally, the list decoding radius 
(LDR) of C is the largest 77 such that for every constant e > 0, the list size for radius (ij — e) 
is bounded by some function f{q,£) independent of the block length n.^ 

Our main result on tensor products is the following: if C has relative distance 5 and LDR ij, 
then the list decoding radius of the m-wise product C^"^ is r/5™~^. In other words, the ratio 
of LDR to relative distance is preserved under tensoring. 

For interleaved codes, we prove that the LDR remains unchanged irrespective of the number 
of interleaves. In particular, if C has relative distance 6, and C®"* is its m-wise interleaving, 
then for every i] < 6 one has rj) ^ A- (.(C, rf)^ where A, B are constants depending only 

on 5, 7] and independent of m. 

Organization. Formal statement of all the results of this work appear in Section 2. These 
include the results described above which apply to arbitrary codes, along with improved list 
size bounds for special cases like binary linear codes and transformations. We present our 
bounds for interleaved codes in Section 3, and our decoding algorithm for tensor products 
together with a sketch of its analysis in Section 4. The formal analysis of the tensor decoding 
algorithm appears in Section 5. In Section 6, we use the notion of generalized Hamming 
weights to derive improved list-size bounds for tensor products and interleavings of binary 
linear codes. Finally, in Section 7, we show list size bounds for linear transformations that are 
tight over small fields. 

2. Our Results 

2.1. List decoding tensor products. Given two codes Ci C [q]'^'^ and C2 C [q]'^'^, their 
tensor product C2(8>Ci consists of all matrices in [q'j'^^xni -^^j^ose rows belong to Ci and columns 
belong to C2. 

We design a generic algorithm that list decodes C2 Ci using list decoders for Ci and C2 as 
subroutines, and bound the list-decoding radius by analyzing its performance. Further, if we 
have efficient list decoders for Ci and C2, then we get an efficient algorithm for list decoding 
C2® Ci. A brief overview of this algorithm is given below in Section 2.1.2. Our main result 
on list decoding tensor products is the following. 

Theorem 2.1. LetCi C andC2 ^ be codes of relative distance 61 and 62 respectively, 
and < ?7i, '/]2 < 1- Define r]* = mm{6iri2, S2'r]i) . Then 

In particular, if the LDR of Ci,C2 are r]i,rj2 respectively, then the LDR 0/C2 C?) Ci is at least 

7] . 

The decoding radius achieved in Theorem 2.1 is in fact tight: it is easily shown that assum- 
ing that C cannot be list decoded beyond radius r/, C®^ cannot be list decoded beyond 5r] 
(Lemma 4.1). As a corollary, the ratio of the list decoding radius to the relative distance stays 
unchanged under repeated tensoring. 
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Corollary 2.2. If C has relative distance 6 and list decoding radius rj, then the list decoding 
of the m-wise tensor product C®"^ is r]S"^~^ . 

The bounds that we get on Ust-size for are doubly exponential in m. Improving this 
bound to singly exponential in m, say exp(0(m)), could have interesting applications, such as 
a simple construction of list-decodable codes by repeated tensoring, with parameters strong 
enough for the many complexity-theoretic applications which currently rely crucially on Reed- 
Solomon list decoding. We are able to obtain some improvements for the case of tensoring 
binary linear codes; this is described in Theorem 2.8. 

Comparison to Johnson bound. Even assuming that each of Ci and C2 are decodable only 
up to the Johnson bound, Theorem 2.1 gives a bound that is significantly better than the 
Johnson bound, since by convexity Jq{6i62) < mm{5iJq{52),S2Jq(5i)) ^ r]*. 

2.1.1. Implications for natural codes. Theorem 2.1 gives new bounds on list decoding radius 
for some natural families of codes, which we discuss below. 

Reed-Solomon tensors. Let RS[n, A;]^ denote the Reed Solomon code consisting of evalua- 
tions of degree k polynomials over a set S* C of size n, with distance 6 = 1 — k/n. Such codes 
are list-decodable up to the Johnson radius J{6) = 1 — \/l — 6 using the Gur us wami- Sudan 
algorithm [18]. The m-wise tensor product of such a RS code is a [n"^,k'^]q code consisting 
of evaluations on 5"^ C F^, of multivariate polynomials in m variables with individual degree 
of each variable being at most k. Parvaresh et al. considered the problem of decoding these 
product codes, and extended the Reed-Solomon list decoder to this setting [23]. This yields 
relatively weak bounds, and as they note, by reducing the problem to decoding Reed-Muller 
codes of order mk, one can do better [25]. Still, these bounds are weaker than the Johnson 
radius J {5^) of the m-wise product, and in fact become trivial when m > n/k. Our results 
give much stronger bounds, and enable decoding beyond the Johnson bound. 

Corollary 2.3. The m-wise tensor of the RS[n,A;]g Reed-Solomon code is efficiently list- 
decodable up to a radius 6"^~^J{6) — e, where 5 = 1 — kjn is the relative distance o/RS[n, k]q, 
and e > is arbitrary. 

One can compare this with the Johnson bound J{6"^) by noting that 

Hadamard tensors. Let Had be the [q^,k]q Hadamard code, where a S F^ is encoded as 
the vector {a ■ x}^^fk. C is list-decodable up to its relative distance 6 = 1 — 1/q. The m-wise 
tensor product consists of all "block-linear" polynomials. Specifically, each codeword 
in C®"* is a polynomial P on m x k variables given by x = {x^^\x^'^\ . . . ,x^"^^) where each 

x^*) = {xi \ . . . ,x^j^^), such that for each i, P is a linear function in rr^*^ for each fixing of the 
other variables. 

Corollary 2.4. The m-wise tensor of [q^,k]q Hadamard codes has list decoding radius equal 
to its relative distance (l — i)"^. 
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This result is interesting in light of a conjecture by Gopalan et al. stating that Reed-Muller 
codes of degree m over are list-decodable up to the minimum distance (they proved this 
result for q = 2) [12]. Our result shows m-wise Hadamard tensors which are a natural sub code 
of order m Reed-Muller codes (with better distance but lower rate) are indeed list-decodable 
up to the minimum distance. 

2.1.2. Tensor decoder overview. Our algorithm for list decoding C2 ®Ci starts by picking small 
random subsets S C [712] and T C [ni] of the rows and columns respectively. We assume that 
we are given the codeword restricted to 5* x T as advice. By alternately running the row and 
column decoders, we improve the quality of the advice. We show that after four alternations, 
one can recover the codeword correctly with high probability (over the choice of S and T). 
An obstacle in decoding tensor product codes is that some of the rows/columns could have 
every high error-rates, and decoding those rows/columns of the received word gives incorrect 
codewords. We show that the advice string allows us to identify such rows/columns with good 
probability, thus reducing the problem to decoding from (few) errors and (many) erasures. 
The scheme of starting with a small advice string and recovering the codeword via a series of 
self-correction steps has been used for list decoding Hadamard codes and Reed-Muller codes. 
Our work is the first (to our knowledge) that applies it outside the setting of algebraic codes 
defined over a vector space. 

2.2. Interleaved Codes. Armed with a list decoding algorithm for C, a naive attempt at 
list decoding C®™ would proceed as follows: List decode each column of the received word 
separately to obtain m different lists {Li, . . . ,Lm}, then iterate over all matrices with first 
column from Li, second column from L2, etc., and output those close enough to the received 
word. The naive algorithm described above yields the following simple product bound 



This upper bound is unsatisfactory since even if i{C,r]) = 2, the upper bound on l{C®^,ri) is 
2™. Recent work of Dinur et al. [6] overcame this naive product bound when the codes being 
interleaved arise from group homomorphisms. To this end, they extensively used properties 
of certain set systems that arise in the context of group homomorphisms. 

Surprisingly, we show that the product bound can be substantially improved for every code 
C. In fact, the list size bound we obtain is independent of the number of interleavings m (as 
in the above-mentioned results of Dinur et al. [6] ) . 

Theorem 2.5. Let C he a code of relative distance and let r] < 6. Define b = [^r^] and 
r = [log jr^] ■ Then, for all integers m ^ 1, we have 



The implies that if C is list-decodable up to radius rj, then so is C®"*. The condition i] < 6 in 
Theorem 2.5 is necessary, as it is easily shown that ^(C®"^, 6) ^ 2™ (unless C is trivial and has 
only one codeword). 



(2.1) 



£{C,rj)^£{C^"',rj)^e{C,7]) 



m 



(2.2) 
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2.2.1. Proof technique. The proof of Theorem 2.5 rehes on a simple observation which we 
outhne below. Assume that we list decode the received word corresponding to the first column, 
to get a list of candidate codewords for that column and pick one codeword from this list. 
Rows where the first column of the received word differs from this codeword correspond to 
errors, hence we can replace those rows by erasures. Thus for the second column, some of 
the error locations are erased, which makes the decoding easier. Of course, if the codeword 
is close to the received word, then there may be very few (or no) erasures introduced. But 
we show there are only a few codewords in the list that are very close to the received word. 
Extending this intuition, we construct a tree of possible codewords for each column and show 
that the tree is either shallow or it does not branch too much. 

2.3. Better list-size bounds using generalized Hamming weights. For the case of 
binary linear codes, we are able to improve the list size upper bounds for both tensoring and 
interleaving (Theorems 2.1 and 2.5 above) using a common technique. We now describe the 
underlying idea and the results it yields. 

2.3.1. Method overview. Codewords of both interleaved and 2-wise tensor products are natu- 
rally viewed as matrices. We bring the rank of these matrices into play, and argue that if the 
rank of a codeword is large, then its Hamming weight is substantially higher than the distance 
of the code. It turns out that this phenomenon is captured exactly by a well-studied notion 
in coding theory called generalized Hamming weights (see the survey [33]) that is also closely 
related to list decoding from erasures [13]. The precise connection is that if a codeword of 
Q&m j^g^g rank r, then its relative Hamming weight is at least the r*'^ generalized Hamming 
weight 6r{C) of C. Similarly, rank r codewords in C (8> C have weight at least 6r{C)6{C). 

For binary codes, for r large enough, 6r{C) approaches 25. The Johnson radius of 26 exceeds 
5. Therefore, for r = r{6,r]) large enough, the number of codewords in a Hamming ball of 
radius r] < 5 whose pairwise differences all have rank > r can be bounded from above using 
the Johnson bound. Using the deletion argument from [12], the task of bounding the list-size 
for radius r] now reduces to bounding the number of rank ^ r codewords within radius rj. We 
accomplish this task, for both interleaved and product codes, using additional combinatorial 
ideas. We remark that our use of the deletion argument is more sophisticated than in [12], 
since most of the work goes into bounding the list-size for the low-rank case. 

We note that the reason the above approach does not work for non-binary alphabets is that 
the generalized Hamming weight Sr{C) may not be larger than ^^i^ for g-ary codes. 

2.3.2. Results for interleaved codes. Theorem 2.5 showed that for any code C of distance 6 and 

for any 7] < 6, the list-size for the m-wise interleaved code C®'" is bounded by i{C , rj)^^"^ 
Note that for r] ^ 5, the exponent grows without bounds. For binary linear codes, using the 
above generalized Hamming weights based approach, we can improve this bound to a fixed 
polynomial in l(C,r]), removing the dependence on log(l/(5 — rj)) in the exponent. 

Theorem 2.6. For any binary linear code with distance 6, we have: 



(2.3) 
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Given a binary linear error-correcting code C, the Johnson bound states that the hst size at 
radius J2{S) — e is bounded by 0(e~^). We can show that essentially the same list-size bound 
holds for C®*", provided the distance 6 is bounded away from ^. 

Theorem 2.7. For every 6 < there exists a constant cs such that for every binary linear 
code C of relative distance 6, 

l{C®^,U5)-e)^C5e-\ 

2.3.3. Result for tensor product. Applying the above approach to the tensor product of two 
codes, we prove the following (Theorem 6.15). Note that the list size is at most a fixed 
polynomial in the list sizes ^(Ci,r7i) and i{C2,ri2) of the original codes, instead of the quasi- 
polynomial dependence in Theorem 2.1. 

Theorem 2.8. Suppose Ci C and C2 ^ are binary linear codes of relative distance 61 
and 62 respectively. Let rji ^ Si and 772 ^52- Define rj* = mm{5ir]2, (52??i) o-n-d r = [log(^j^)] . 
Then 

iiC2 Ci,v* - e) ^ 20(^'k{Ci,viYi{C2,V2re-^'^ . 

2.4. List decoding linear transformations. Let Lin(Fq,A;,m) denote the set all linear 
transformations L : ^ F™. The code Lm{¥q,k,m) is nothing but the m-wise inter- 
leaving Had®'^(Fg, fc) of the Hadamard code Had(Fg,fc). Let £(Lm{¥q, k,m),rj) denote the 
maximum list size for the code Lm{¥q,k,m) at a distance r]. Dinur et al. [6] show that 
£(Lin(F2,A;,m),l/2 - e) ^ 0{£-'^) and ^(Lin(Fg, /t, m), 1 - l/q - e) ^ 0{e~^) for some con- 
stant C for general q. The best lower-bound known for any field is Q.{e~'^). 

Being a general result for all codes. Theorem 2.5 only gives a quasipolynomial bound for the 
special case of linear transformations. By specializing the above generalized Hamming weights 
approach to the case of linear transformations, and using more sophisticated arguments based 
on decoding from erasures for the low-rank case, we prove the following stronger bounds for 
list decoding linear transformations over F2. 

Theorem 2.9. There is a constant C such that for all positive integers k, m, 

£(Lin(F2,A;,m),^ -e) ^ Ce~^ . 

For arbitrary fields F^, we prove the following bounds, the first is asymptotically tight for 
small fields while the second is independent of q and improves on the bound of Dinur et al. . 

Theorem 2.10. There is an absolute constant C such that for every finite field ¥q, 

^(Lin(F„, k,m),l---e)^C' miufg^'e-^ e"^) . 
q 

3. Interleaved Codes 

In this section, C C [g]" will be an arbitrary code (possibly non-linear) over an alphabet [g]. 
We will use £(77) and i'^"^{r]) for i{C,r]) and i{C®"^,r]) respectively. Let dq{ci,C2) denote the 
Hamming distance between strings in [g]" and Ag(ci, C2) = dq{ci,C2)/n denote the normalized 
Hamming distance. We drop the subscript q when the alphabet is clear from context. For 
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r S [q]" , B(r, ry) C [g]" denotes the Hamming ball centered at r of radius rfn. We use C for 
codewords of C®"* and c for codewords of C. We will interchangeably view C as a matrix 
in [g]"-^™' and a vector in [g™]". For a k x m matrix A, ai, . . . ,am will denote its columns, 
a[l], . . . , a[k] will denote the rows, and A^i will denote the k x i matrix(ai, . . . , Oj). 

Given an algorithm DecodeC that can list decode C up to radius r/, it is easy to give an 
algorithm DecodeC®™ that uses DecodeC as a subroutine and runs in time polynomial 
in the list-size and m; we present this algorithm in Section 3.1. Thus it suffices to bound 
the list-size to prove Theorem 2.5. We do this by giving a (possibly inefficient) algorithm, 
which identifies rows where errors have occurred and erases them. Erasing a set S C [n] of 
co-ordinates is equivalent to puncturing the code by removing those indices. Given r £ [g]"", 
we use to denote its projection on to the co-ordinates in [n] \ S. 

Definition 3.1. (Erasing a subset) Given a code C C [g]", erasing the indices corresponding 
to 5 C [n] gives the code = {c~^ : c G C} C [g]""!-^!. 

Let |5| = iJ,n. We will only consider the case that fi < 6. It is easy to see that the resulting 
code has distance d{C^^) ^ {6 — /x)n. There is a 1-1 correspondence between codewords 
in C and their projections in . For the code , it will be convenient to consider standard 
Hamming distance, to avoid normalizing by 1 — For 77 < 1 — //, let i~^{r]) be the maximum 
number of codewords of that lie in a Hamming ball of radius rjn in 

Lemma 3.2. For any i] < 1 — fi, £~^{ri) ^ ({rj + fi). 

Proof. Take a received word r~^ G gQ ^j^^t there are L codewords c]^^ , . . . ,c~[^ 

satisfying d{r~^ ,c~^) ^ rjn. Define r G [g]" by fixing values at the set S arbitrarily. By the 
triangle inequality, d{r, Ci) ^ (r/ + /x)n, showing that l{rj + /i) ^ L. □ 

Assume we have a procedure List-Decode that takes as inputs set 5 C [n], r G [g]", an 
error parameter e and returns all codewords c G C so that d{c~^ ,r~^) ^ e (it need not be 
efficient). We use it to give an algorithm for list decoding C®"^, which identifies rows where 
errors have occurred and erases them. Assume we have fixed C^i = (ci, . . . , Cj). We erase the 
set of positions S where C^j 7^ R^i and then run a list decoder for C^^ on rj-|_i. The crucial 
observation is that since the erased positions all correspond to errors, the number of errors 
drops by |5|. The distance might also drop by |5|, but since rj < 5 to begin with, this tradeoff 
works in our favor. 



Algorithm 1. Erase-Decode 
Input: R G [g™]",??. 

Output: List C of all C G C®™ so that A^m (i?, C) ^ . 

Set 5i = (/>, /ii = . 
For i = 1, . . . , m 

1. Set £i = LiST-DECODE(5j, rj, (ry - /Xj)n) . 

2 . Choose Cj <— £j . 

3. Set S'i+i = {j G [n] s.t. C<^i[j] / fii+i = Si+i/n. 
Return C = (ci, . . . , Cm) ■ 
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In Step 2, we non-deterministically try all possible choices for a; the list C is obtained by taking 
all possible Cs that might be returned by this algorithm. Also, Cj is a codeword in C~^^ but 
we can think of it as a codeword in C by the 1-1 correspondence. Different choices for q lead 
to different sets and hence to different lists So the execution of Erase-Decode 

is best viewed as a tree, we formalize this below. 

For a received word R, Tiee{R) is a tree with m + 1 levels. The root is at level 0. A node v 
at level i is labeled hy C{v) = (ci, . . . , q). It is associated with a set S{v) C [n] of erasures 
accumulated so far which has size fi{v)n. The resulting code C~^^^^ has minimum distance 
6{v)n ^ (6 — jjL{v))n. We find all codewords in C""^^"-* that are within distance (r/ — n{v))n 
of the received word r^^^'"\ call this list C{v). By Lemma 3.2, C{v) contains at most l{ri) 
codewords. Each edge leaving v is labelled by a distinct codeword Cj+i from C{v)] it is assigned 
a weight w{ci+i) = d{Cj^_^^'"\r-_^^'"^)/n. The weight 'w{c) G [0, 1] of an edge indicates how many 
new erasures that edge contributes. Thus fj,{v) = w{ci) + • • • + w^Ci). The leaves at level m 
correspond to codewords in the list C. There might be no out-edges from v if the list C{v) 
is empty. This could result in a leaf node at a level i < m which does not correspond to 
codewords. Thus the number of leaves in Tree(i?) is an upper bound on the list-size for R. 

In order to bound the number of leaves, we assign colors to the various edges based on their 
weights. Let c be an edge leaving the vertex v. We color it White if w{c) < 6 — rj, Blue if 
■w{c) ^ 5 — rj but w^c) < and Red if w{c) ^ White edges correspond to codewords 
that are very close to the received word. Blue edges to codewords that are within the unique- 
decoding radius, and Red edges to codewords beyond the unique decoding radius. 

We begin by observing that White edges can only occur if the list is of size 1. 

Lemma 3.3. // a vertex v has a White out-edge, then it has no other out-edges. 

Proof. Assume that the edge labelled with c G >C(f) is colored White, so that d{c,r-_^^^'^) < 

{6 — r])n. Let c' be another codeword in C{v), so that d{c' ,r-_^^^^) ^ {rj — ^{v))n. Then by 
the triangle inequality, 

(i(c, c') < {5 — r])n + (r? — fi{v))n = {5 — fj,{v))n ^ 5{v)n 

But this is a contradiction since d{c,c') ^ 6{v)n. □ 

We observe that Blue edges do not cause much branching and cannot result in very deep 
paths. 

Lemma 3.4. A vertex can have at most one Blue edge leaving it. A path from the root to a 
leaf can have no more than Ij^pj] Blue edges. 

Proof. The first part holds as there can be at most one codeword within the unique decoding 
radius. Each Blue edges results in at least {6 — rj)n erasures. Therefore, after [^j^^] Blue 
edges, all rjn errors have been identified, so all remaining edges have to be White. □ 

Lastly, we show that Red edges do not give deep paths either, although a vertex can have up 
to £{t]) Red edges leaving it. 
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Lemma 3.5. A path from the root to a leaf can have no more than [log(^j3^)] Red edges. 

Proof. We claim that every Red edge leaving vertex v has weight at least {5 — ^{v))/2. Indeed, 
since c is beyond the unique-decoding radius of C~^^^\ w{c) ^ ^y^, and the relative distance 
5{v) of the code C~^^'"^ at node v satisfies 5{v) ^ {5 — fi{v))n. 

Assume now for contradiction that some path from the root to a leaf contains k red edges for 
k > [log(^4^)]. Suppose that the edges have weights pi,...,pk respectively. Contract the 
Blue and White edges between successive Red edges into a single edge, whose weight is the 
sum of weights of the contracted edges. We also do this for the edges before the first Red 
edge and those after the last Red edge. This gives a path contains 2k + 1 edges, where the 
even edges are Red, and the weight of the edges along the path are f3i, pi, f32, ■ ■ ■ , Pk, Pk+i 
respectively. Let Vi be the parent vertex of the i*^ Red edge for i G [k]. Then we have 
p{vi) = Pi and p{vi) = A + Pi-i + p{vi-i) for j ^ L But since ^ ((5 - p{vi^i))/2 and 
(3i ^ 0, we get 

5 + p{vi-i) 
Kvi) > ^ 

Now a simple induction on i proves that p{vi) ^ 6{1 — 2^"*). If we take i = \iog{j^)~\ + 1, 
then 

So when we decode at vertex Vi, all the error locations have been identified and erased. Hence 
we are now decoding from r] < 5 erasures and no errors, so the decoding is unique and error- 
free. So vertex Vi will have a single White edge leaving it and no Red edges, which is a 
contradiction. □ 

Theorem 3.6. Assume r] < 6 and let b = , f = [log ^4^1 • Then Tree(i?) has at most 

{^Vl^i'nY leaves (and hence £®"'{r]) ^ {^^,'') i{vY ) ■ 

Proof. We first contract the White edges, since they are the only out-edges leaving their 
parent nodes. This gives a tree with only Red and Blue edges. Let t{b, r) denote the 
maximum number of leaves in a tree where each path has at most b Blue and r Red edges, 
and each node have have at most one Blue edge and lijj) Red edges leaving it. So we have 
the recursion 

r) ^ t{b - 1, r) + l(j])t{b, r - I) 
with the base case tib, 0) = 1. It is easy to check that t{b,r) ^ (^+'')^(r/)''. □ 

We conclude this section by proving that the condition rj < 5 \s necessary. 
Lemma 3.7. // code C has relative distance exactly 6, then i®"^{6) ^ 2"*. 

Proof. Take ci,C2 G C where Ag(ci,C2) = S. Now take the received word in C®*" to be 
R = (ci , . . . , ci ) . For every T C [m] , let Ct to be the codeword where the i^^ column equals 
ci if i G T and C2 otherwise. It is easy to show that every such codeword is at distance at 
most 6 from R, showing that ^ 2^". □ 
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3.1. A Efficient Decoding Algorithm for C®™. Given a received word R = (ri, . . . , rm), 
for i ^ m let R^i = (ri, . . . , rj). 



Algorithm 2. DecodeC®"^ 
Input: i2G[g™]",r?. 

Output: List of all C € C®'" so that A^m (fi, C) r? . 

1 . For i = 1, . . . , m 

Set £,j = DECODEC(ri,?y). 

2 . Set >C^i = Ci . 

3 . For i = 2, . . . ,m 
For C G C<^i-i X £i. 

Add C to C<^i if Agm(C, i?<;j) ^ rj. 

4. Return C^m- 



Claim 3.8. Assume that DECODEC(r, r/) rans in time T. Then DECODEC®'"(i2, 77) returns 
a list of codewords within distance rj of R in time 0{mT + m?nl{ri)l®'^{ri)). 

Proof. For any C sucli that A{C,R) ^ r], it must hold that for every i, Agi{C^i, R^i) ^ 77. 
An easy induction on i shows that C^i G C^i, which proves the correctness of the algorithm. 

It is clear that Step 1 takes time 0{mT). To bound Step 3, we use the following simple 
observation for all i ^ m: 

Thus each iteration of the loop in Step 3 requires computing the distance between R and at 
most i[rf)i®'^{rj) candidates for C . □ 

4. List Decoding Tensor Products 

In this section and the next, Ci C F^^ and C2 C F^^ will be linear codes over Fg. Given a 
matrix A G [q']"'^^"'^, and two subsets S C [712] and T C [ni], we shall use A[S', T] to denote 
the submatrix indexed by rows in S and columns in T. Further, we will write A[S, *] instead 
of A[S^ Thus the symbol "*" when used as an index, denotes the sets [ni\ or [^2]. 

Fix a received word R G [gj^sx"-! and a codeword C G C2 (8) Ci so that 6{R, C) ^ r]* - 3e. 
The advice/guess T] to the algorithm TensorDecode consists of the values of C on a 
random submatrix S x T. Given the advice A, the TensorDecode algorithm works in four 
phases, described informally below. This is followed by a formal description of the algorithm 
in Figure 1. The reader might find Figure 2 helpful to develop intuition about the operation 
of the various phases. 

Phase 1: We run the list decoder on each row s G <S to get a new advice string B[S,-k\. If 
there is a codeword in the list that agrees with the advice ^[s,r], we set -B[s,*] to be that 
codeword (ties are broken arbitrarily; or we could declare an erasure if there isn't a unique 
choice). If there is no such codeword, we set every symbol in i?[s,*] to _L (which denotes an 
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erasure). Note that the hst for row s might contain some codeword other than C[s,*] that 
agrees with C[s, ★] on the positions in T. In such a case B[s, -k] could be incorrect. As a resuh, 
some rows in B[S,-k\ agree with the codeword C, some of them could be wrong, and the rest 
are erasures. Claim 5.3 show that with high probability, (1 — (52 + e)|5'| of the rows are correct, 
and no more than e\S\ are incorrect. 

Phase 2: Viewed columnwise, B[S,-k] gives us advice strings for the co-ordinates S of every 
column codeword. However, the advice is noisy: it is correct on {1 — 62 + s) fraction of co- 
ordinates within 5, wrong on an e fraction, and _L on the rest. But since any two codewords in 
the column code C2 are distance 82 apart, in expectation the advice string has more agreement 
with the correct codeword than any other; thus it is likely to identify the correct codeword 
from a small list of candidates. We create a new advice string D[k,-k\ by list decoding every 
column t S [ni], and selecting from the list a codeword that disagrees with B[S,t] in less 
than e fraction of co-ordinates. If no such codeword exists we set the column to _L. Claim 5.4 
shows that at least {1 — 5i + 2e) fraction of columns are correctly decoded to the corresponding 
columns of C and no more than e fraction are incorrectly decoded. 

Phase 3: Viewed row-wise, D gives an advice string for every row that is correct on at 
least {1 — 61 + e)ni co-ordinates, wrong on at most eni and blank on the rest. The advice 
though noisy is sound: since the code Ci has distance 5ini, a simple application of the triangle 
inequality shows that there is a unique codeword which disagrees with Z)[s,*] on fewer than 
erii co-ordinates, and that is the row codeword C[5, Thus the advice uniquely identifies the 
correct row codeword. We create a new received word E by list decoding each row and using 
D to identify the correct codeword in the list, and setting the row to _L if no such codeword 
exists. Claim 5.5 shows this step will find the correct codeword on (1 — 82 + 3e) fraction of 
the rows. 

Phase 4: When viewed column-wise, E gives the correct value of C on 1 — J2 + 3e fraction 
of co-ordinates, and is blank on the rest. Crucially, it does not have any incorrect symbols. 
So now we can recover C by decoding each column from erasures (note that one can uniquely 
decode C2 from less than a fraction 82 of erasures) . 

It is easy to show that the list decoding radius reached by TensorDecode is the correct one. 



Lemma 4.1. For a linear code C, i{C^'^,6r]) ^ ^(r?). 



Proof. Let r G [g]" be a received word with codewords ci, . . . ,C£ £ C within radius rj. Take cq 
to be a codeword of minimum weight 6. Define the received word r' = cq(S> r. It is easy to see 
that the codewords c • = cq 18) Q for i € [i] are all within distance 5r] from r', which proves the 
claim. □ 



Thus, if list decoding C beyond radius rj is combinatorially intractable, then so is decoding 
C®^ beyond radius 6r]. 
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Algorithm 3. TensorDecode 

Setup : Let Decodei, Decode2 denote list decoding algorithms for Ci and C2, up to error 
rates r]i and i]2 respectively. Let ii{r]i) and ^2('?2) be the upper bounds on list size output 
by Decodei and Decode2 respectively. Fix -q* = min((5ir/2, (52^i)- Let e > be a parameter 
to the algorithm. 

Input : A received word R such that 6{R,C2 Ci) ^ rj* — 3e. 
Output : A list L of ah codewords C e C2 O Ci with 6{C, R) ^ ij* - 3e. 

• Pick subsets 5 C [722] and T C [m] uniformly at random among all 
subsets of size nii and 1712 respectively. 

• For each assignment A : S x T ^ [q] 

— Phase 1 (Computing B : S x [ni] — > [q]) 
For each s £ S , 

* List decode the row i?[s,*]: £s = Decodei (i2[s, *]) . 

* Set B[s,*] to be an arbitrary codeword c in the list Cg 
satisfying c[T] = A[s,T] and B[s,*] =_L if no such codeword 
exists . 

Define Sfaii = {s £ S\B[s,*] =_L} and Ssuccess = S - Sfaii- 

— Phase 2 (Computing D : [712] x [ni] — > [q]) 
For each t € [rii] , 

* List decode the column £t = Decode2(-R[*, t]) 

* Set to be any codeword c in the list Ct satisfying 

^{c[S success], B[Ssuccessit\) < ■ Fix D[*,t] =_L if no such 

codeword exists. 
Define TfaU = {t G [ni]\D[*,t] =_L} and Tsuccess = [ni] - Tfaii- 

— Phase 3 (Computing E : [712] x [ni] [q]) 
For each s G {^^2}) 

* List decode the row i?[s,*]: £s = Decodei (i?[s, *]) . 

* Set E[s,*] to be any codeword c in the list Cs satisfying 

^{c[Tsuccess],D[s, Tsuccess]) < £"-1- Set E[s,*] =_L if no such 
codeword exists. 
Define Ujaii = {s £ [n2]\E[*,t] =_L} and (/success = ["-2] - Ufaii- 

— Phase 4 (Computing C : [712] x [rii] [q]) 

For each t £ {ni} , Unique decode the column under erasures: 

C[*,t] = UniqueDecodeErasures2(i?[*, t]) 

— Output C if 6{C,R) ^ if - 3e. 



Figure 1. List decoding algorithm for tensor product 
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Ci(ni,5i,7yi) 




C2(n2,S2,m) 

Phase II Phase III 

Figure 2. Phases of the TensorDecode Algorithm 
5. Analysis of the Tensor Product Decoder 



In this section, we will prove the correctness, analyze the list size output and compute running 
time of the TensorDecode algorithm. Towards this goal, we first present a concentration 
bound concerning sums of samples chosen without replacement. 

Lemma 5.1. Let zi, Z2, ■ ■ ■ , Zn be real numbers bounded in [0, 1]. Let S C [n] be a uniformly 
random subset of size m. Then, 



Ft 



The above concentration bound essentially a restatement of Corollary 1.1 in Serfling's work 
[27] on sums in sampling without replacement. Henceforth, for the sake of succinctness, we 
shall use the notation p{^,m) to denote the upper bound {2e~'^'^ in the above lemma. 

Firstly, we will show that for every codeword C close enough to the received word, the 
algorithm TensorDecode returns C with probability 1/4, given the right advice string 
A = C[S,T]. 
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Theorem 5.2. For a codeword C G C2®Ci within distance ij* — 3e of the received word R, the 
algorithm TensorDecode with input R, and guess A = C[S,T] returns C with probability at 
least 1 7712) - ^i(r/i)p((5i, mi)/e - £2(??2)p(e, "i2)/e- 

Proof. The argument is broken up into four parts (Claims 5.3, 5.4, 5.5, 5.6) each concerning 
a phase of the TensorDecode algorithm. □ 

5.1. Phase 1. In this phase, the algorithm constructed the advice string B[S, *] starting with 
the guess ^[5, T] and list decoding each row in S. Of the set of rows S success on which the 
decoding succeeded, some of them are decoded correctly to the corresponding row in C, while 
some are incorrect. Define sets Sr,Su, ^ Ssuccess to be the sets of rows that are decoded 
correctly and incorrectly respectively. Formally, 

Sr = {s E 5'|-B[s, *] = C[s, *]} Suj = Ssuccess Sr 

We make the following claim regarding the fraction of rows decoded correctly. 

Claim 5.3. With probability at least 1 — p{e,m2) — li{rji)p{5i,mi) / e over the choice of the 
sets S , T 

\Ssuccess\^{l-S2 + 2e)\S\ \Sr\'^{l-52 + e)\S\ \S^\^e\S\. 

Proof. Let Si denote the set of rows in S with fewer than average number of errors. Specifically, 
Si is defined as 

5i = {sgS|5(C[s,*],/2[s,*]) . 

Observe that for each s ^ Si, the codeword C[s, *] will be part of the list £s, obtained by de- 
coding the row s. Consequently, for each s G Si, B[s, *] /_L ,i.e., 5*1 C Ssuccess- We shall lower 
bound the size of Ssuccess by the size of Si. Apply Lemma 5.1 with {zi = (5(C[i, *], *])} 
and the set S. Since 5{C, R) ^ 52??i — 3e, the average of the zi is less than or equal to (52r/i — 3e. 
Thus we get 



Pr 
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5{C[S, *], *]) ^ 52r?i - 2e ^ p{e, m2 



Let us suppose 5{C[S, *], *]) ^ 82rji — 2e. By an averaging argument, for at most 82 — 2e 
fraction of the rows {C[s, *]|s G 5"}, the distance 5(C[s, *], *]) > ryi, i.e., \Si\ ^ {I — 62 + 
2e)|S'|. This immediately implies the lower bound on size of Ssuccess- 

Now we shall upper bound the probability that a row s G 5 is decoded to an incorrect 
codeword c ^ C[s,*]. Fix an s £ S and a codeword c G Cg other than C[s, *]. The codeword 
c is chosen, if and only if it agrees with the advice A[S, T] = C[S, T] on the columns in T, i.e., 

C[s,T] = c[T]. Applying Lemma 5.1 on the set of real numbers ^Zi = l|C[s,i] 7^ c[z]|| and 

the choice of the set T, we can conclude 
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C[s,T]=c[T] s^p((5i,mi) . 



By a union bound over all codewords c £ Cs, for any row s £ S, the probability of decoding 
an incorrect codeword is upper bounded by 
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C[s, *] / B[s, *] A B[s, *] ^± ^ ii{7]i)p{6i,mi 
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Hence in expectation, at most ii{rji)p{5i,mi) fraction of rows in S are decoded incorrectly, 
i.e E[|5'iu|] ^ ■^i(^i)p(<5i, JTii)!*?!. Applying Markov's inequality, we get 

Suppose \Sw\ ^ ^I'S'I, then observe that 

\Sr\ = \Ssuccess ~ 'S'lol ^ (1 ~ ^2 + • 

Thus with probability at least 1 — p{e,m2) — ii{r]i)p{6i,'mi) /e both of the assertions of the 
claim hold. □ 

5.2. Phase 2. In Phase 2, the algorithm uses the advice B[S, *] generated in the first phase in 
order to decode the columns of the matrix. Among the columns Tsuccess that are decoded suc- 
cessfully, let Tj- , Tw C Tsuccess denote the set of rows that are decoded correctly and incorrectly 
respectively. Formally, define sets T^^Tw as follows: 

Tr = {te [ni]\D[*, t] = C[*, t]} = Tsuccess - Tr 

Given that the advice B[S, *] generated in the first phase is correct on sufficiently many rows, 
the following claim suggests that at least l — 5i fraction of the columns are decoded successfully, 
of which most columns agree with codeword C. In other words, the advice *] generated 
by this phase is a near-sound advice string with almost all columns having either the correct 
value or a failure symbol _L. 

Claim 5.4. Conditioned on the event that the assertions of Claim 5.3 hold, With probability 
at least 1 — ^2{rj2)v{^if^2) / ^ over the choice of the sets S , T 

l^successl ^ (1 - 5i + 3e)ni |Tr| ^ (1 - (5i + 2e)ni \Tw\ eni 

Proof. Along the lines of the proof of Claim 5.3, let Ti be the set of columns with fewer than 
average fraction of errors. Define the set Ti as follows: 

Ti = {tG [ni]\5{C[*,t],R[*,t]) ^r,2} 

By an averaging argument, for at most 5\ — 3e fraction of the columns {C[*,t]|t G [f^i]}i the 
distance 5{C[*,t], R[*,t]) > r/2, i.e., |Ti| ^ (1 — (5i -|-3e)ni. 

Observe that for each column t € Ti, the codeword C[*,t] belongs to the list Cf By Claim 

5.3, in Phase 1, at most e\S\ rows in S were decoded incorrectly, i.e, \Suj\ ^ Hence 
the codeword C[*,t] satisfies A{C[Ssuccess,t]T ^i^successTt]) ^ s\S\. Consequently, for each 
t G Ti, -D[*,t] i.e. Ti C Tsuccess- Thus a lower bound on the size of Tsuccess is given by 
iTil ^ (l-5i + 3e)ni. 

Fix at G [ni]. Let us suppose D[*,t] is neither equal to _L or C[*, t]. Thus i] is a codeword 
in C2 such that 

A[D[S success 1 1] T C[S success it]) ^ ^\S\ 

By assertion of 5.3, 

\Sfail\ ^\S\ — \Ssuccess\ ^ (^2 — 2e)|S'| . 

From the above inequalities, we can conclude 

6{D[S,t],C[S,t]) . 
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Fix a codeword c G Ct- By the distance property of the code, we have (5(c, C[*,t]) ^ 82- 
Applying Lemma 5.1 on the set of real numbers j^j = 1 \c[i\ 7^ C[i, t]] | and the set S, we get 
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By a union bound over all codewords c ^ Ct, for a column t £ T, 
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In other words, the expected size of is at most i2iV2)p{£,iT^2)\T\. Applying Markov's 
inequality, we get: 
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^ ^2(??2)p(e,"i2)/e 



\Tw\ > eni 
To finish the argument, observe that 

|-^r| — {^success '^w\ ^ (1 '^l ~1~ 26^) 71^ • 

Thus with probability 1 — ^2(^2)^(^5 ?7T'2)/£ both of the assertions of the claim hold. □ 

5.3. Phase 3. This phase converts a near sound advice into a perfectly sound advice 

£'[*,*] all of whose rows are either the correct codewords or the fail symbol _L. The following 
claim is a formal statement of this fact. 

Claim 5.5. Conditioned on the event that the assertions of Claim 5.4 hold, for each s G 
Usuccess, E[s, *] = C[s, *] and \Usuccess\ > {I - ^2 + 3e)n2. 

Proof. For each s G [712], we claim that C[s,*] is the unique codeword satisfying 

Clearly, this implies that for each row s G Usuccess, we decode the correct codeword, i.e., 
E[s, *] = C[s, *]. For the sake of contradiction, let us suppose there exists c 7^ C[s, *] satisfying 

^{(^[Tsuccess], Dl^SjTguccess]) ^ £n\. 

By triangle inequality, we can conclude 

^{p\Esuccess\i C\s .iTsuccess\') ^ '2£Tl\. 

Further from claim 5.4, \Tfaii\ ^ [ni] — \Tsuccess\ ^ (^i — 3e)?ii. This implies 

A(c, C[s, *]) ^ 2eni + {5i - 3e)ni < diui. 

This is a contradiction since c and C[s, *] are two distinct codewords of Ci that are less than 
5i apart. 

Let U\ denote the set of rows with less than average fraction of errors; formally, 

Ui = [s£ [n2]|(5(C[s,*],i?[s,*]) 5;r?i} . 

By an averaging argument, for at most 82 — 3e fraction of the rows of C, the distance 
5{C[s, *], i2[s, *]) > r/i, i.e., \Ui\ ^ (1 — J2 + 3e)?i2- For each row s G C/i, we have C[s, *] G Cg- 
From Claim 5.4, for at most eni columns in Tg^ccess^ C[*,t\ 7^ Z?[*,t]. Consequently, for each 

rows G Ui, E[s,*\ /_L, i.e., Ui C Usuccess- HeUCC We get \Usuccess\ ^ \Ul\ ^ (l-(52 + 3e)n2. □ 
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5.4. Phase 4. This is a fairly simple phase where a perfectly sound advice is used to 

completely retrieve the codeword C. Specifically, we make the following claim : 

Claim 5.6. Conditioned on the event that the assertions of Claim 5.5 hold, the algorithm 
TensorDecode outputs the codeword C. 

Proof. By Claim 5.5, we know *] = C[s, *] for at least (1 — ^2 + 3e)n2 rows and E[s, *] =_L 
for the remaining rows. Hence for each column t G [ni], the UniqueDecodeErasures2 algorithm 
returns the codeword C[*,t]. Thus the algorithm TensorDecode returns the codeword 
C. □ 

Theorem 5.7. Civen two codes Ci, C2, for every e > 0, the number of codewords 0/C2 <^ Ci 
within distance rj* = min(5i?72, (^2^1 ) — 3e of any received word is bounded by 

^(C2®Ci,r7*) ^4g^ ^ ^ . 

Further, if Ci and C2 can be efficiently list decoded up to error rates 771,7/2 and C2 is a linear 
code, then C2i8>Ci can be list decoded efficiently up to error rate rj* . Specifically, ifT denotes the 
time complexity of list decoding Ci andC2, then the running time of the list decoding algorithm 



for C2 Ci is Oiiq^'i' ' ' x Tnin2) 

Proof. Rewriting the expression for the probability in Theorem 5.2 using Lemma 5.1, 

2 



e 



Set mi = In ^^^^''^'^ and m2 = t^t In ^^^fe) ^ g^sy to see that the probability of success 

is at least j with this choice of parameters. In other words, with this choice of parameters, 
any codeword C within distance r]* — 3e is output with probability at least | if the initial 
guess A is consistent with C. Hence, the number of codewords within distance rj* — 3e from 
the received word R is 

1 In 8<^l('7l) In 8<^2('72) 

£(C2 0Ci,r?*-3e) ^4g'"i™2 = ' ' . 

It is easy to check that the running time of the algorithm is as claimed above. □ 

Theorem 5.8. Let C be a linear code with distance 6, list decodable up to an error rate r]. 
For every e > 0, the m-wise tensor product code C®^ can be list decoded up to an error rate 



^I'n- — £ with a list size exp ( 



Proof. Applying Theorem 5.7 with Ci = C2 = C, we get list size bound at error rate 6r] — 3e for 
C(dC. Applying theorem again on C®^, we get list size bounds at error rate (5^ x {6i] — 3e) — 3e = 
S^T] — 36^e — 3e. In general, for C*^^*" let ?72fc denote the error rate at which we obtain a list 
size bound. Then, 

k „ 
r]2k = 77 — 3e> >o 7 



1-52 

i=0 
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For brevity, let us denote by the list size i.{C®'^^ ,ri2k). Then from Theorem 5.7, we have 
the following recursive inequality: 



ln5fc+i ^ TTo-^ln hln4 



Rewriting the above inequality, 

In^^l^l-^ — + l-4 + lnl^i^ln^l^ 

£ A5^£^ £ £ 26^£^ £ 

Set Sfc = ln4S'fc/e and a = jp^- Then we have the recurrence relation : 

Sk+i ^ a • Sfc So = In ■ 



£ 



Thus we get Sfc ^ (ln4£(?7)/e)2 a2 -1 < (aln4^(ry)/e)2". Hence we obtain the following list 
size bound for m = 2^ . 

C^-,5-S- < exp ( 



l_52y--^^V 252^2 

Rewriting the above expression with £ in place of jz^^'i 

' '91nglnl2£(?7)/e(l - 5^ 



^(C^™,5"^-^7?-e) < exp 



2^2(1 _J2)2£2 



□ 



6. Improved List-size Bounds via Generalized Hamming Weights 

In this section, we prove improved list-size bounds on tensor products and interleavings of 
binary linear codes. This is done by making a connection between the weight-distributions of 
such codes and the classical coding theoretic notion of Generalized Hamming Weights. This 
allows us to use the Deletion technique of [12] to reduce the problem of bounding list-sizes to 
the low-rank case. 

We start by introducing the version of the Deletion Lemma that we need. It is a mildly 
stronger version of the deletion lemma from [12], the graph theoretic view was proposed by 
Impagliazzo. 

Lemma 6.1 (Deletion lemma). [12] Let C C F^' be a linear code over ¥q. Let C C be 
a (possibly non-linear) subset of codewords so that c' G C iff —c' G C , and every codeword 
c £ C\C' has wt(c) ^ fi. Let i] = Jq{n) — 7 for 7 > 0. Then 

Proof. Let r G denote a received word, and let C = {ci, . . . , ct} be the list of all codewords 
in C so that A(r, Cj) ^ t]. Construct an (undirected) graph G where V{G) = {ci, . . . , q} and 
{i,j) is an edge if Cj — cj £ C. Our goal is to bound |y(G)|. 

We claim G does not have large independent sets. Let / = {ci, . . . , c^} be an independent set. 
This means that for every i / J G [s], Q — Cj C so A(cj, Cj) ^ /.t. But every codeword in / 
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lies within distance rj of r. We now invoke the Johnson bound which states that in a code of 
distance fi, the hst-size at radius Jq{n) — 7 is bounded by 7"^. This shows that a{G) ^ 7"^. 

We claim that the degree of G is bounded by i{C' , rj). Suppose that a vertex c has d neighbors 
{ci, . . . , Cd}- They can be written as c + c'^, . . . , c + c'^ where c' £ C . Since 

A(c + c^,r) = A(c^,r — c) ^ r] 

the codewords c[, . . . ,c'^ give us a list of codewords in C at distance t] from the received word 
r — c. Hence d ^ i{C',r]). 

Thus G has degree d(G) ^ £{C',7]) and the max independent set size a{G) ^ 7~^- Thus 

\G\^a{G)d{G)^^-^e{C',rj). 

□ 

The Deletion lemma of [12] corresponds to taking C to be all codewords for weight less than 
H, and using £{C',r]) ^ \C'\. However, in our applications \C'\ will be too large for this to be a 
useful bound, thus we essentially use the Deletion lemma as a reduction to the low-rank case. 

Generalized Hamming Weights (GHWs) arise naturally in the context of list-decoding from 
erasures [13]. For a vector v of length n, Supp(i)) C [n] denotes the co-ordinates where Vi 7^ 0. 
For a vector space V, Snpp{V) = Uy^v Supp('y). 

Definition 6.2. The r^^ generalized Hamming weight of a linear code C C denoted by 
5r{C) is defined to be | Supp(Vfc)|/n over all A;-dimensional subspaces T4 of the code C. 

Clearly, 5i{C) = S(C) is just the minimum distance. The following lower bound on 5r{C) which 
is folklore [15, 33], says that as we consider larger values of r, 5r{C) approaches — ryiJ. Thus 
for binary linear codes, 5r{C) approaches 26{C) as r grows. 

Lemma 6.3. For any linear code C C with minimum distance 5{C) and any r ^ 1, 

S,.(C) > ^^(C) (1 - 1) . 

Given a matrix C £ F^^™, let Rank(C) denote its rank, let RowSpan(C) be the space spanned 
by its rows and ColSpan(C) be the space spanned by its columns. We use the following 
standard fact from linear algebra: 

Fact 6.4. Given C £ ¥'^^'^ such that I{ank(G) = r, let {vi, . . . ,Vr) be a basis for KowSp&n^G) . 
Then we can write 

r 

C = ^ Us (g) Vs 
s=l 

for some vectors {ui, . . . , Ur} which form a basis for ColSpan(C). 

6.1. Interleaved Codes. In this subsection C is a binary linear code. We i^"^{r]) to denote 
rj). We use C®"* to denote the sub-code of C®™ consisting of codewords of rank at most 
r, and if"^{rj) for ^(C®™', rj). The following lemma relates the rank of a codeword to GHWs. 

Lemma 6.5. Given C E C®*" such that Rank(C) = r, wt(C) ^ 6r{C). 
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The lemma holds since dim(ColSpan(C)) = r hence its support is at least 6r{C). We now 
apply the deletion argument to reduce the problem of bounding the list-size to the low-rank 
case. 

Lemma 6.6. Let C be a binary linear code and let r = [log ^] . Then for any rj ^ 6, we have 
Proof. It is easy to check that 

6r > 25(1 - 2-^) > 2(5- J^ 

x2 

J{Sr) = 1- y^l-5r> 6+—. 

The bound on J{6r) may be verified by observing that 

j~\S + ^) = 26-5^-^<6r. 

Let C consist of all codewords C where Rank(C) ^ r so that we can take fi = 6r- Since 
J(/z) ^ 6 + 6'^ /2 whereas ^ 5, we have 

7 = ^(/^) - ?? = (<5 - ??) + y ^ y • 
Applying the Deletion Lemma 6.1, we obtain the desired conclusion i®"^{vi) ^ D 

An immediate corollary of Fact 6.4 is 

Corollary 6.7. Given a codeword C EC®™ ofrankr, /ei {6[1], . . . , 6[r]} 6e tesis /or RowSpan(C). 

Then C can be written as C = Yll=i "X" h[s\ where Cs £ C for s £ [r]. 

Our goal is to reduce the low-rank problem for interleaved codes to the case when m = r, 
by fixing a basis for the row-space. The following lemma narrows the choices for the basis 
elements to rows that have reasonably large weight. 

Lemma 6.8. Let e > and let rj = 5 — e. Let C € C®"* be a rank r codeword and R be a 
received word such that A(i2, C) ^ ry. There is a basis {b[l], . . . , b[r]} for RowSpan(C) where 
wt{C,b[s]) ^ £2^-'' for all s £ [r]. 

Proof. Let 

S = {b£ Span(C) | wt{R, h) ^ £2^~\ b / 0"*}. 

We claim that S contains a basis for RowSpan(C), or equivalently dim(S') = r. Assume for 
contradiction that dim(S') = r — 1, and that b[l], . . . ,b[r — 1] is a basis for it. Complete it 
to a basis for Span(C) by adding b[r], and let S' = b[r] + {b[l], . . . ,b[r — 1]). Note that <S" is 
disjoint from S. 

By Corollary 6.7, we can write C = Ylt=i'^t ^W- ^r[i] 7^ 0, then C[i] € S'. Since 
wt{cr) ^ Sn, it follows that 6n of the rows of C lie in S'; that is wt(C, S') ^ 5. Since 
A(C, i?) ^ T], we have wt(i?, 5') ^ e. But since \S'\ = 2^~^, there must exist b £ S' so that 
wt{R,b) ^ e2^~^ . Since S' is disjoint from S, this contradicts the definition of the set 5. □ 
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Lemma 6.9. Let e > and let rj = 6 — e. Set r = [log . We have 

Proof. By Lemma 6.6, it suffices to bound ^®'"((5 — e) where r = We fix the choice of 

2 

basis {b[l\, . . . , b[r]} for RowSpan(C). Lemma 6.8 shows that there are at most 2*" e~'^ choices 
for the basis. We then map R:W^^¥^ toa received word R' :¥^^ {F^ U ★} as follows: 

if R{x) Span(C) 



R'{x) 




Every C' satisfying A{R' , C) ^ 77 is in on-to-one correspondence with C so that A(i?, C) ^ r] 
and Span(C) = {b[l], . . . , b[r]). So the number of such codewords is bounded by i®^{r]). □ 

Clearly, i®^{ri) ^ ^ivY ■ Plugging this into Lemma 6.9 gives Theorem 2.6. Further improve- 
ments on this bound are possible using the analysis of Theorem 3.6 combined with better 
list-size bounds for decoding binary codes from erasures; we present them is Section 6.3. 

The only step which needs g = 2 is Lemma 6.6, where we choose r large enough so that 
J2{^r) ^ ^- This does not have an analogue over Fg since 6r may only increase by a factor 
q/{q — 1), hence there may not be an r such that Jq{6r) > S. But the step of bounding the 
number of small-rank codewords works for any field. 



6.2. Tensor Products. Li this subsection, Ci and C2 are binary linear codes. We use Si^r 
to denote the r*^ generalized weight of Ci. We use ii{T]) for £{Ci,r]) and £'^{ri) for £{€2 Ci, r]) 
and £f{ri) for the list-size when we restrict ourselves to codewords of rank at most r. 

The following lemma relates the weight-distribution of tensor product codes to the generalized 
Hamming weights of Ci and C2. While the lemma is straightforward, we have not found an 
explicit statement in the literature. 

Lemma 6.10. Given C G C2 Ci such that Rank(C) = r, wt(C) ^ 25i52{l - 2"*^). 

Proof. The column rank of C is r, thus | Supp(ColSpan(C))| ^ 52,rn2- Each of these indices 
corresponds to a row from Ci with weight thus overall the codeword has weight at least 

52,rhn2'ni ^ 5i52(l - 2~'')nin2. □ 

If we let wtr denote the minimum weight of a rank r codeword, we have wt^ ^ 2(^i52(l — 2"''). 
We now show a reduction to the low-rank case for tensor products. 

Lemma 6.11. Set r = log(jj^). Then for any r] ^ S162, 

Proof. We claim that for any 5 ^ ^, J2{25 — 5^) > 6 + 5^/2. To prove this, observe that 
J-\6) =26{l-6). Hence 

J2~^(5 + (5V2) = 2(5 + 52 -252(1 + 5/2)^ < 26-6^. 
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Lemma 6.10 shows that for r = log(j^) any codeword C with Rank(C) = r has weight at 
least 

wir = 25i(52(l - 2-") ^ 25i52 - 5l5l. 
We apply the Deletion lemma taking C to be all codewords of rank at most r, with ^ = 
— ^f^l- Since J(/i) > 8182 + (^i(^|/2, whereas r/ ^ 8182, we can take 7 = (5f(5|/2, which 
gives the desired bound. □ 

A corollary of Fact 6.4 for tensor product codes is: 

Corollary 6.12. Let C G C2®Ci he a codeword of rank r, and let {vi, . . . ,Vr) = RowSpan(C). 
Then C can be written as C = Yl^s=i ® where {ui, . . . ,Ur) = ColSpan(C). 

Fix a received word R, which we wish to decode from r]* — e fraction of error where rj* = 
m.m{rji82,ri28i) and e > 0. By decoding each row up to radius r/i, we get lists £1, . . . ,£„2 of 
codewords from Ci each of size at most ti {rji ) . By decoding each column up to radius r/2 , we 
get lists C'l, . . . ,C'ni of codewords from C2 each of size at most £2(112) ■ The following lemma 
gives an analogue of Lemma 6.8, restricting the choice of basis vectors to those that occurs 
relatively frequently among the lists. 

Lemma 6.13. Let C G C2 iX" Ci be a rank r codeword such that A{R,C) ^ ij* — e. There is 
a basis V = {vi, . . . , Vr} for RowSpan(C) where each Vi occurs in at least e2^~^n2 of the lists 

Proof. Consider the set of codewords S = {v G Ci} which occur in the row lists at least 
e2^~''n2 times. We claim that S contains a basis for RowSpan(C). Assume for contradiction 
that it only spans an r — 1 dimensional subspace. Choose {wi, . . . ,Vr-i} which form a basis 
for it and complete it to a basis by adding Vr- Define the set 5' = w,. + (vi, • • • , Wr-i)- Now by 
Corollary 6.12, we can write C in the form C = X]s=i '^s ® Vs for some ui, . . . ,Ur £ S which 
span ColSpan(C). 

Note that wt{ur) ^ 82, hence at least (52712 rows (corresponding to indices in the support of 
Ur) come from the set S' , call this set A C [712]. Since the error rate is rj* — e ^ 82rji — e, 
it must be the case that for some subset S C A of rows where |-B| ^ en2, the error rate on 
those rows is less than r/i; else the overall error rate is at least {82 — e)?7i > 82'r]i — e ^ rj* — e. 
List decoding rows in B up to radius rji recovers the corresponding row vector from C. So a 
vector from S' occurs in all lists for rows in B. Hence one of these vectors has to occur with 
frequency at least e2^~''?i2, but this contradicts the fact that S and S' are disjoint. □ 

Similarly, let T denote the set of vector u G C2 which occur in in at least e2^~^'ni of the lists 
C i. One can show that T contains a basis U = {ui, . . . , u^} for ColSpan(C). 

Lemma 6.14. We have 

ef{v'-e)^2^'%-'^h{rj,Yi2{r,2r. 

Proof. Note that |5| ^ 2'^~^ii{rii)e^^ , and \T\ ^ 2''~-'^£2(??2)e~^- We choose r basis vectors 
from S and T respectively as bases for RowSpan(C) and ColSpan(C), for which there are 
at most |5|''|T|'' choices. We then choose r row vectors {vi, . . . ,Vr} from RowSpan(C) and 
r column vectors {ui, . . . , Ur} from ColSpan(C) so that C = X]s=i '^i ® This gives 2^'' 
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additional choices. Thus (.2 {if — can be bounded hy 2^"" \SY\T\'', which gives the desired 
bound. □ 

Putting together Lemmas 6.14 and 6.11, we have proved the following theorem. 

Theorem 6.15. Let r = [log(^^^)], 772 ^ ^ ^2 o,nd rf = min(7/i52) ??2'^i)- Then there 

exist constants ci,C2 so that for any e > 0. 



-e)^ci2^2^'£i(7?ir£2(r?2re-'^ 

6.3. Further Improvements for Interleaved Codes. Theorem 2.6 was proved by first 
reducing to the rank r case for constant r, then reducing to m = r by fixing a basis, and 
using the trivial upper bound l®^{r]) ^ (-{riY ■ One can improve on this last bound using the 
analysis of Theorem 3.6 combined with better list-size bounds for decoding binary codes from 
erasures. 

Theorem 6.16. For any binary linear code C of relative distance 5, let r = [log p-] . For any 
T] < 6 



For a binary error correcting code with relative distance 5 and fi ^ rj < 6, we let /(^,?7 — /i) 
denote the list-size for C with /i erasures and r] — errors. 

Lemma 6.17. For any n ^ rj, we have 

/(/i,r?-^) ^2-^(r?-^/2). 



Proof. Let r G {0, 1, be a received word where wt(r, ★) ^ fi. Let C = {ci, • • • , Ck} denote 
all codewords of C that satisfy A(cj, r) ^ r/. Set the erased positions of r at random in {0, 1}, 
call this received word r'. Then Pr[A(r',Ci) ^ t] — ^/2] ^ ^. Thus, in expectation over the 
settings of the erased bits, k/2 of the codewords from C satisfy A(r',Ci) ^ ry — ///2. Fixing 
one such choice of r: 

□ 

The following lemma completes the proof of Theorem 6.16. 

Lemma 6.18. Let C be a binary code with distance 5. Then for any r] < 5 we have 
(6.2) i^^v) ^ rr - ^^^) • 



Proof. We run Algorithm 1 on the received word R and build a tree. We mark each edge 
leaving v Blue if it lies within the unique-decoding radius (for the code being decoded at 
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v) and Red otherwise. A simple induction shows that after k Red edges, we have at least 
5{1 — 2~^) erasures. Thus, after k Red edges, the degree of the tree drops to 

(5(1-2"^) 
2 

Solving the recursion for the number of leaves shows that 



3'-(r/)^r2^n^(^-^ii^) 



□ 



Equation 6.2 allows us to replace (-{riY term in Equation 2.3 with the product i{ri)i{r] — 
5/4)l{rj — 35/8) • • • . This advantage is pronounced if the list size decreases rapidly as the 
radius shrinks; which happens if we use the Johnson bound to bound the list-size. We finish 
the proof of Theorem 2.7. 

Proof of Theorem 2. 7. 

Set 7? = J2(<5) — e. Note that 6 — r] 6 — J2{S). Let r = [log ^] . Using Lemma 6.9, we get 

(6.3) £®™(J2(5) -s)^ ^__^__£0-(j2(5) -e)= csi^^{J2{d) - e) , 

for some constant cg that only depends on 5 < 1/2. Applying Lemma 6.18, we conclude that 

r-l 



i'^'\J2{6)-e)^r2'll£{J2{6)-jk) 



k=0 

where = — — + £• The Johnson bound states that i{J2{S) — 7) ^ 7~^- Foi' A; ^ 1, we 
can lower bound 7^ by ^ — —■ Hence we have 

(6.4) P^{J2{S) -e)^ rre-^ J] , ^ c',e~\ 

k=l ^ ' 

Combining Equations 6.3 and 6.4 gives the desired result. □ 



7. List Decoding Linear Transformations 

In the previous sections, we have developed techniques for decoding generic interleaved codes 
based on generalized Hamming weights and decoding from erasures. These can be applied 
to get sharper bounds for specific families of codes. We illustrate this in the case of linear 
transformations. 

The problem of list decoding linear transformations is equivalent to list decoding interleaved 
Hadamard codes. Equivalently, one can think of the message as a matrix M over Fg of 
dimension k xm, encoded by the values x^AI for every x G F^'. Thus the encoding is a matrix 
C of dimension x m where each column is a codeword in the Hadamard code. Recall the 
well known list size bound £{1 — - — e) ^ 0(e~^) for Hadamard codes over F^. 
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Consider the space Lin(F|j, k, m) of all linear transformations L : F^' — > F™. We use £®™(Hadg, rj) 
instead of to denote the denote the list-size for Lin(Fq, k, m) at distance t]. We let Linr(Fg, k, m) 
denote the space of all linear transformations of rank at most r in Lin(Fg,A;,m) and let 
^^"^(Hadq, ry) denote the list-size for Linr(Fq, /c, m). 



7.1. Linear Transformations over F2. We first use the Deletion lemma to reduce the 
task of proving list-size bounds for Lin(F2, k, m) to that of proving bounds for Lin2(F2, fc, m). 
In this subsection, we use ^®"^{rj) to denote ^®'"(Had2, rf) and li®'"^{rf) to denote (Had2 , ??) . 

Lemma 7.1. For any rj < 1/2, we have ^ C£^'"(??) where C is an absolute constant. 



Proof. We take C to consist of all linear transformations of rank at most 2. Thus for any 
codeword L ^ C' , Rank(L) ^ 3, hence wt(L) ^ |, so we take /x = |. Since J(7/8) > 0.65 
and ?7 < ^, we can apply the Deletion lemma with 7 = 0.15 to conclude that i®"^{ri) ^ 



To bound £^'"(77), we start by bounding ^f"^{r]), which is the maximum number of rank 1 
linear transformations within distance rj of the received word R. We need some facts about the 
structure of L{x) when L is a linear transformation of rank r. Recall that we use ^[1], . . . , l[k] € 
for the rows of L, thought of as vectors from F™. Let RowSpan(L) C Fg^ denote the r- 
dimensional space spanned by these vectors. We write RowSpan(L) = . . . ,l'[r]) to 

denote the fact that . . . , l'[r] form a basis for RowSpan(L). For a function r : F^' ^ F™ 
and a vector v E F™, we define wt{R,v) = Prx[R{x) = v]. 

Lemma 7.2. For L G Lin,.(-F25 k, m) and any v £ RowSpan(L), wt(L, v) = . 



Proof. The j*^ column of L defines a linear function lj{x) = x^lj from F2 ¥2. We 
have L{x) = {li{x), . . . ,ltix)). Let us pick a basis for the columns, assume that this ba- 
sis is li,...,lr- Let v<^r denote the projection of v onto the first r co-ordinates. We have 
Prri;[{h{x), . . . , lr{x)) = v^r] = 2"*", which implies the claim. □ 



7.1.1. Bounding rank 1 linear transformations. The above lemma implies, in particular, that 
if Rank(L) = 1 and RowSpan(L) = (/'[I]), then wt{L,l'[l]) = wt(L,0'=) = i. 

Corollary 7.3. Let L G Lini(F2, A;, m) and R : ¥^ ^ ¥f be such that A{R,L) ^ ^ - e. // 
RowSpan(L) = (/'[I]), then wt{R,l'[l]) ^ e. 



This narrows the choice of basis vectors for RowSpan(L) to at most ^ vectors where wt(L, /'[I]) ^ 
e. Once we fix /'[I], the problem reduces to Hadamard decoding (or rather m = 1). Given 

{F2 U ★} as follows: 



R : F^ 



F™, we define r : Fg 



r(x) 



if R{x) = O'", 

1 if = 

-k otherwise. 



Setting r{x) = -k denotes an erasure; since R{x) RowSpan(L), we know there is an error at 
index x. Given a Hadamard codeword I ■.¥\ ^ F2, if we define L G Lini(F2, /c, m) by reversing 
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the substitution: 

fr[l] if/(x) = l, 
^ ' [O'" if /(x) = 

then it follows that A(i?, L) = A(r, I). This proves the following claim. 

Lemma 7.4. The linear transformations L S 'Lviii[F2,k,m) so that A{R,L) < ^ — e and 
RowSpan(L) = (/'[I]) are in one-to-one correspondence with Hadamard codewords I so that 
A{r,l) < i-e. 

Since there can be at most O(p-) codewords of the Hadamard code within distance ^ — e, 
and at most ^ choices for this suffices to prove a bound of 0{\). We can improve this 
to O(^) by observing that if there are many choices for I' [I], then each of them is likely to 
result in fewer codewords. This relies on the following Lemma about Hadamard decoding with 
erasures. 

Lemma 7.5. Given r : ^ {F2 U *} so that wt(i2,*) ^ rj, the number of codewords I such 
that A(r, /) < e is bounded by (^^_^2e)'^ ' 

Proof. We use the well known fact that the number of codewords so that A(r, /) < | — e is 
bounded by So assume there are s codewords Zi, . . . , Z^. Consider setting each erasure to 
a random value in F2. For any I, with probability ^ we have A{r,l) ^ k ~ 2 ~ ^' 'Thus in 
expectation, | of the codewords will now satisfy A(/,r) < \ — ^ — £■ Fix one such setting of 
the erasures. But now we have 

s 1 2 

2 ^ (r? + 2e)2 ^ ^ ^ lji+2e^' 

□ 

We can now show an 0{e~'^) bound on rank 1 transformations. 
Lemma 7.6. For any e > 0, we have 

Proof. Assume that the condition wt(i?, /') > e holds for t non-zero row vectors /' G F^. Fix 
one such choice of I' . We then erase r] ^ (t — l)e positions, so by Lemma 7.5 the number of 
candidates is no more than ^^q:^^. Thus overall the list-size is bounded by ^^^^^^^.-j: ^ 2e^- ^ 

7.1.2. Bounding rank 2 linear transformations. We now proceed to analyzing the list-size for 
rank 2 linear transformations. We begin with the analog of Corollary 7.3 for the rank two 
case. 

Corollary 7.7. Let L G Lin2(F2, k, m) and R:¥^^¥^ be such that A{R, L) ^ i - e. There 
is a basis {u,v} /or RowSpan(L) such that wt{R,u) and wt{R,v) are both at least |. 

Proof Note that wt(L, wt(L, /[2]) and wt(L,Z[l] + l[2]) are all exactly i. If two of the 
symbols {Z[l], Z [2], /[!] + / [2]} have weight less than | in i?, then A (i?, L) > 2(i-§) = i-e. □ 
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In fact, we can assume that wt{R,u) ^ ^ and wi{R,v) ^ |, but we will not use this claim. 
Once we pick u and v we reduce to the case m = 2 by defining r : — > {F2 U as follows: 

(A, fi) if R{x) = An + fiv, 
-k otherwise. 



r(x) 



We have the following analogue of Lemma 7.4. 

Lemma 7.8. The linear transformations L G Lin2(F2, fc, m) so that A{L,R) < ^ — e are in 
one-to-one correspondence with linear transformations I G Lin2(F2, k, 2) so that A{1, r) < \ — e. 

Lemma 7.9. Let r : F2 — > {F2 U*} he such that wt(r, ^ 77. The number of linear transfor- 
mations I e Lin2(F2,fc, 2) so that A(Z,r) < ^ — e is hounded hy (^^^2e)'^ ' 

Proof. Let I = (hjh) where each /j : F2 ^ F2. We proceed as in Theorem 2.5, erasing errors 
whenever they are located. We run Hadamard decoding on the first column which gives a list 
-^1 of {r)+2e)'^ candidates for £1. We then choose c G /^i, erase the positions where it differs 
from R and list decode the resulting word to get candidates for 12- 

At most one codeword c G £1 can have error rate less than |. If we choose li = c, we have 
no more than ^^^^^^ choices for the second column. The other codewords in £1 which are 

beyond distance | result in r/ ^ | erasures, hence a list-size of C ^ 32 for the second column. 
So the overall list size is bounded by 

2 2 100 

1 • . . „ .0 + . . „ .0 • 32 ^ 



(r? + 2e)2 (r? + 2e)2 ^ {r] + 2ey 

□ 



We are now ready to bound the number of rank 2 transformations. 
Lemma 7.10. There is a constant C so that for any e > 0, we have 



Proof. Assume there are i ^ 2 non-zero vectors v G F™ so that wt(?;) ^ |. By Corollary 7.7, 
each there is a basis {u, u} of L where wt(i?, u) and wt(i?, v) are both at least |. Once we fix 
this basis, we erase all positions containing vectors other than u, v and u-\- v. This results in 
rj ^ {t — 3)e erasures. Thus by Lemma 7.9, the number of L such that RowSpan(L) = (u, v) 
is bounded by 

100 ^ 100 

{ri + 2e)2 ^ (t- 1)2^2 • 

Since there are (2) choices for {u, we get an overall bound of 

t\ 100 _ t 100 100 
2) ' (t- l)2e2 - 2(t- 1)7^ ^ 1^ 

If there are t < 2 nonzero vectors f G F2* with wt(t;) ^ |, then it corresponds to the rank one 
case. Thus, adding bounds for rank 1 and rank 2 cases, we have — e) ^ ^ -|- = 

^. ' ' □ 
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7.1.3. Final list-size bound. Combining Lemma 7.10 with Lemma 7.1, we conclude that for 
some absolute constant C > 0, 

thus completing the proof of Theorem 2.9. 

7.2. Linear Transformations over F^. In this subsection, we use i^"^{r]) to denote 
£®™(Hadg, 1]) and ^®"*(r/) to denote if {Hadg , rj) . One can repeat the above proof of Theorem 
2.9 from the previous subsection for any ¥g, and show that for some absolute constant c, 

^®"^(Had„ 1-l/q-e) . 

This gives an asymptotically tight bound for constant q. In this subsection, our goal will be 
to remove the dependence on q. We will do so at the expense of a worse dependence (of e~^) 
of the bound on e. 

We begin with a deletion argument analogous to Lemma 7.1. The difference is that we only 
delete rank 1 transformations, and the multiplicative overhead is 0{e~'^) as opposed to an 
absolute constant. 

Lemma 7.11. Forr] = ^-e, we have £®™(?7) ^ ^^f™(r?). 

Proof. We take C to consist of all linear transformations of rank at most 1. So we can take 
ji = 1 — -^2 and J{fJ.) = 1 — |. Thus we can apply the Deletion lemma with rj = e to conclude 

that £®™(r?) < ^if^iv)- □ 

Given v £ we now define wt{R,v) = X^^^jr* -P^'xeF™ [-^(^) = M^]- We have the following 
g-ary analogue of Corollary 7.7. 

Corollary 7.12. Let L G Lini(Fg, k, m) and i? : F^ ^ F™ 6e such that A{R, L) ^ 1 - | - e. 
//RowSpan(L) = (v) then wt{R,v) ^ e. 

Fixing the vector v (from amongst t ^ choices), reduces the problem to Hadamard decoding 
over ¥q. The Johnson bound states that at radius 1 — | — e, the list-size is bounded by p-. 
This gives the following bound. 

Lemma 7.13. For any e > we have (.f^[l - | - e) ^ p-- 
Plugging this into Lemma 7.11 completes the proof of Theorem 2.10. 
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